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1.  Introduction 


A  set  of  observations  under  a  design  Is  said  to  be  Influential  In 
this  paper  if  the  set  affects  not  only  the  fitting  of  the  node!  to  the 
data  but  also  the  prediction  in  terms  of  the  fitted  model.  In  the 
problem  of  identifying  sets  of  t  (a  positive  integer)  influential 
observations,  we  assume  the  underlying  design  is  robust  against  the 
unavailability  of  any  t  observations  [Ghosh  (1979)].  We  first  explain 
this  concept  by  considering  the  standard  linear  model 


E(Z)  =  XJ  , 

(1) 

V(y)  =  o2I  , 

(2) 

Rank  X  =  p  , 

(3) 

where  y(Nxl)  is  a  vector  of  observations,  X(Nxp)  is  a  known  matrix, 

B(pxl)  is  a  vector  of  fixed  unknown  parameters  and  a  is  a  constant  which 

may  or  may  not  be  known.  Let  d  be  the  underlying  design  corresponding  to 

The  design  d  is  assumed  to  be  robust  against  the  unavailability  of 

any  t  observations  in  the  sense  that  the  parameters  in  8  are  still  un- 

biasedly  estimable  when  any  t  observations  in  £  are  unavailable.  There 
r  N 

are  possible  sets  of  t  observations.  The  idea  of  robustness  of 
designs  against  unavailability  of  data  is  fundamental  in  measuring  the 
influence  of  a  set  of  observations. 

We  first  measure  the  influence  of  a  set  of  t  observations  by 
assuming  the  observations  in  the  set  unavailable  and  then  calculating  the 
sum  of  variances  of  their  predicted  values  from  the  remaining  (N-t) 
observations.  The  largest  value  of  the  sura  indicates  the  corresponding 
set  of  t  Observations  is  the  most  influential  in  terms  of  precise 
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prediction  of  unavailable  observations.  We  also  measure  the  influence  of 
a  set  of  t  observations  by  assuming  them  unavailable  and  then  calculating 
the  sum  of  squares  of  the  elements  of  the  covariance  matrix  between  the 
least  squares  fitted  values  of  the  remaining  (N-t)  observations  and  a 
complete  set  of  orthonormal  linear  functions  of  with  zero  expecta¬ 
tions.  The  largest  value  of  the  sum  of  squares  indicates  the  correspond¬ 
ing  set  of  t  observations  is  the  most  influential. 

The  importance  of  knowing  the  influential  set  of  observations  at  the 
design  stage  is  that  (1)  we  can  assess  the  influence  of  a  set  of  unavail¬ 
able  observations  in  the  planned  analysis,  (2)  in  the  case  of  deficit  of 
budget  during  a  long  term  experiment  using  the  robust  design  where  it  may 
be  a  good  idea  not  to  collect  observations  which  are  least  influential. 

2.  First  Method 


We  denote  the  ith  set  of  t  observations  in  7  by  and  the  re- 

,<i>. 


maining  observations  in  y>  by  71  ,  the  corresponding  submstrices  of  X  by 

and  X^^;  the  resulting  design  when  t  observations  in  the  ith  set 
are  unavailable  by  i=l,...(^).  The  least  squares  estimators  of  J3 

under  d  and  d(i)  are  8  =  (X’X)-1X'y_  and  S  (i )  =  (X^1  *  ' Xj(1 )  *  . 


We 


(i) 


A  A 


write  the  fitted  values  of  7  under  d  and  d  as  7^  =  3^  and  7 


X6d •  When  t  observations  in  the  ith  set  are  unavailable,  the 


(i) 


predicted  values  of  unavailable  observations  ^  from  available  observa- 


*(i )  (i)  * 

tions  are  the  elements  in  =  ^  of 


es 


timators  can  be  judged  by  v(y^  )  =  °2X21  (xi  Xj  )  X2  •  The 


(i)  . 


first  measure  of  influence  of  72  is  defined  as 


-"JV®  s'  viv  S"  Vl  S*  S  O  >0  "  K.*  •  /V  *  »  *  »  ’  7  *  »  \ 


* 

I 
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II^0)  “  Trace  v(^1})  .  (4) 

The  smallest  value  of  Ti(x^^)»  i=l>...,(^)>  for  i=u,  Indicates  that  the 
uth  set  of  t  observations  is  the  least  influential  in  terms  of  precise 
prediction  of  unavailable  observations.  On  the  other  hand  the  largest 
value  of  Ii(yii))»  i-i». ••»(£).  for  i=w,  indicates  the  wth  set  of  t 
observations  is  the  most  influential. 

We  denote  B  =  1^  +  (xj1^  *X^  )_1X^  ' .  It  can  be  checked 


that 


B(i)  =  lt  ”  X21)(X'X>~1  X21}'» 


V)  '  ®d  =  (X’X) 


•i  v(i)'  -  (y(i)g  _  (1 ) 1 

X2  B(i)lX2  Bd  *2 


EfX(i)B  -  v(1)lfx(1)fi  -v(i)l’  =  oV1 

E  X2  4  *2  JlX2  4  *2  J  _°B(i)- 


(5) 

(6) 
(7) 


We  denote  the  ith  observations  in  by  y^  and  the  It h  row  in  X  by  x^, 

i=l , . . . ,N. 

Theorem  1  For  any  design. 


(i) 


x^(X'X)  1xi 


ie{ilf...,itJ  1  -  xJ(X'X)  2^ 


(8) 


where  the  rows  of  X  are  rows  of  X^*\ 

Proof.  It  follows  from  and  B^  given  in  (5)  that  for 


i  =  i, 


1  +  *i  lXx  xi  J  x  > - _!  * 

^  l-xj(X'X)  V 

"~1  ~~± 


i  •  6  t  j 


l-x.'(X'X) 


X;(x’x)  lx± 

H(X'X)  1x1 


The  rest  is  easy 


Theorem  2  If  for  a  design,  the  individual  observations  are  equally 


influential  then 


W  '  (Wp)  •  <9) 

Proof.  When  the  individual  observations  are  equally  influential,  I^(y^) 

is  a  constant  independent  of  i  for  t=l  and  thus  and 

(x^^  X^*^)  is  3  constant  independent  of  i.  This  in  turn  implies 

from  (5)  that  for  t=l,  x!(X'X)  *x.  is  a  constant  independent  of  i. 

— i  1 

We  know  trace  X  (X'X)  ^ X'  =  p  and  thus  x^(X'X)  ^x^  =  ~  .  From  (5),  we 

get  x.'(x^)  xf^)  ^x.  =  7\r- -v-  and  hence  the  result. 

°  — iv  1  1  J  — 1  (N-p) 

Theorem  3  If  for  a  design,  the  individual  observations  are  equally 


influential,  then 


i. hi1’)  >  ftfev  • 


c  (N_p)  * 

Proof.  For  t=l  and  equally  influential  individual  observations, 
x.'(X'X)  *x,  =  and  hence  the  result  in  (10)  follows  from  (8). 

From  (9)  and  (10),  we  observe  that  for  a  design  with  equally  influential 
individual  observations  IjQr^M  2  c  I  ^  ( y ^  )  . 

3.  Second  ffethod^ 

Let  z((N-p)xN)  be  a  matrix  such  that  Rank  Z  =  (N-p),  ZX  =  0  and 

ZZ'  =  I.  It  can  be  seen  that  Cov(y^,Zy)  =  0.  This  implies  that  has 

the  minimum  variance  within  the  class  of  all  unbiased  estimators  of 

E(y^).  under  (1-3).  When  t  observations  in  the  ith  set  are  unavailable, 

A(i)  (i)A 

the  least  squares  fitted  values  are  y,  =  X,  8  , .  We  denote  the 

1  d  ( i ) 

submatrices  of  Z  corresponding  to  xj^  and  X^^  by  zj^  and  Z^  ^ .  It 
follows  that  Cov  (yj1)  ,Zy  )  =  o2 [x^1 ^ (x^1 ^ ' X^1 ) )“ 1 X^1 ^ ' Z^1 ^ '  ] .  The 
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further  Cov(x.l*^»ZyJ  is  away  from  the  null  matrix,  the  more  influential 
is  the  set  of  t  observations  We  thus  have  the  second  measure  of 


influence  as 


*2(x!^^)  =  0  ^  [Sum  of  squares  of  elements  in  Cov(^^\z^r)]  .  (11) 


We  now  show  some  similarities  between  our  two  measures  of  influence 


Il(yi1))  and  I2(Z21^)* 

Theorem  The  following  is  true  for  i=l,...,l  J, 


v(zillYi  J)  =  v 


Proof.  We  observe  that  z(1)xji)  +  Z^X^  =  0  and  hence  v(z(1i^li^)  = 

o2z(1i)x(1l)(x^l)'x(1l)  =  o2z^i)x^i)(x(1i)'x(11)  )_1x^i),z^i)'  = 


vtz^z^). 


Theorem  5  The  following  in  true. 


I2(Z21;)  =  Trace  vCz^Y1^)* 


Proof.  It  can  be  seen  that 


I2(X2(1))  =  a2  Trace  xj1  *  (xj1 5  '  X^i)  ]  1 X^1  * '  Z^1  *  '  Z^M*  *  (x^ )  '  x(i)  ) 
=  a2  Trace  zii)x(li)(xi1),x5i))  1xi1),z5i)' 

=  Trace  v(zj1Y1^) 

=  Trace  vfz^Y^) 


Corollary  We  have 


I2 CA* ) )  =  Trace  [V (jJ1 } )  ]  [zi1 5  ’  Z^1 >  ] . 


The  equation  (13)  displays  the  similarity  between  two  measures  of 


influence  I\  and  12^X2^  ^ )  •  Although  the  matrix  Z  is  not  unique,  it 

can  be  checked  that  I2(x2^)  Is  unique  for  all  choices  of  the  matrix  Z. 


ww 


? 

r. 


I 
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4.  Relationship 


Cook  (1977)  proposed  a  distance  function  between  y  and  y 

d  d  if) 


popular  as  Cook's  distance,  at  the  inference  stage  as 


^  a  ^ 

;re  (N-p)s^  =  (y.  “  y^)'(y.  ~  y^)*  ^he  ^00^'6  distance  measures  the 


degree  of  Influence  of  t  observations  In  the  1th  set  on  the  estimation  of 


3.  We  now  show  that  our  first  measure  of  Influence  Is  fn  fact 


related  to  . 


Theorem  6  From  (4)  and  (15),  we  have 


E(psdDi) =  xi(4  )• 


Proof.  We  get  from  (6) 


fy  -y  )  *  ( y  -y  )  =  -y  (  1  }  )  '  [  B 2  -R  1fX(i)  3  -y  (  1  }  )  . 

lZd(i)  ydj  l7d(i)  ydJ  1  2  -  y-2  J  1  (1)  (i)JL  2  -  d  y2 


It  now  follows  from  (7),  (15)  and  (17)  that 


E(ps^Di)  =  o2  Trace  (B(i)"Jt) 

=  a2  Trace  xil)(xii),x{1))  1x£i*' 


This  completes  the  proof. 


5.  Examples 


Consider  a  2  factorial  experiment  In  a  completely  randomised  set  up 


and  suppose  the  elements  in  $  are  the  general  mean,  the  main  effects  and 


the  2-factor  interactions.  The  3-factor  and  higher  order  interactions 


are  assumed  to  be  zero.  Thus  p=ll.  The  treatments  are  denoted  by  (x  i  x2 


X3  X4),  x ^=0 , 1 , 1=1 , 2 , 3 , 4 .  For  brevity,  we  indicate  a  treatment  by  the 


.  _  - ,  *  -  -  ,  _  .  .  «  •  .  ■ « *  ■  y  y  y  y_  <*_  ***.''".  ^ •  "*V„-  .F  . '  ■%y 1 .  v.*  F  -  _ 

'^v  .v/' j’’ j1* v  - - - - 


positions  where  the  level  1  is  occuring.  For  example,  the  treatment 
(1100)  is  denoted  by  12.  The  treatmerit  (0000)  is  denoted  by  6. 
Design  I 


Con 

sider 

a  de 

sign 

with 

15  t 

reatments  and 

we  write 

the  t 

reatments  in 

the 

orde 

r  (0,1 

,2,3 

,4,12 

,13, 

14,23 

,24, 

34, 

123,124, 

134, 

234) 

.  Note  that 

the 

elements 

in  Z, 

the 

rows 

of 

X  and 

the 

columns 

of 

Z  corre 

spond 

to  this 

orde 

ring 

.  The 

mat 

rix  Z 

is 

given 

bel 

ow 

4a 

3a 

-3a 

-3a 

-3a 

2a 

2a 

2a 

2a 

2a 

2a 

-a 

-a  -a 

-a 

0 

1 

-1 

1 

-1 

0 

-2 

0 

0 

2 

0 

1 

-1  1 

-1 

z=(. 

25) 

0 

1 

-1 

-1 

1 

0 

0 

-2 

2 

0 

0 

-1 

1  1 

-1 

0 

1 

1 

-1 

-1 

-2 

0 

0 

0 

0 

2 

1 

1  -1 

-1 

where  a  =  (l//5].  The  design  is  robust  against  the  unavailability  of  any 
two  observations  [Ghosh  (1979)]. 

Table  I 

Influences  of  Individual  Observations  Under  Design  I 


-2, 

-2  _ 

Observat ions 

o  I1 

o  I2 

yi 

4.000 

.800 

y1,i=2,... ,11 

2.333 

.700 

y1,i=12,...,15 

4.000 

.800 

•'iV/*  i 


Table  II 


Influences  of  Pairs  of  Observations  Under  Design  I 


Observations 


(y1,y1)»i=2>... , 15;(y2,y15); 


(y3’y14);(y5’y12^y6’yi^i=2, 13; 


Cy7,yi)i=12, IA;(y8,y1)i=13,14; 


(y9»yi)i=12. 15;  (y10»y1)i=13»15‘> 


Cy  1 1  ,yi  )i  =  1/4 ,  !5 . 


Cyi  ,yi)i==6 , . . . ,  1 1 ;  <y2»yi  )i  =  12»  13» 


(y3»yi)i=12,13, 15; (y4,y1)i=12, 14, 15; 


(y5.yi)i  =  13, 15; (y6,yi  )i  =  14, 15; 


(y7  >yi)i  =  13, 15; (yg,y1  )i  =  12, 15; 


(y9.yi)i=13,lA;(y10,y14); 


(yll,yi)i=12,13 


(yl»yl)i  =  l2, -  -  - » !5; 


(yJ2*yi)i=13>14»15; 


^3^)1  =  14,15; 


(y14*y15) 


11.333 


8.000 


8.667 


'  '  . 


R 


rW 


Table  II  (Continued) 


Influences  of  Pairs  of  Observations  Under  Design  I 


Observations 
^>1=3, 4, 5, 9, 10, 11; 

(y3,yi)i=4, 5,7,8,11  ’ 
^y4,yi)i=5,6,8,10; 

(y5 ,yi )i^6 , 7 ,  9 ; 

(y6,yi^i=? ,8,9, 10; 
<y7»yi)i=i8,9,ll;(y8,yi)i  =  10,ll; 

(yg»yi)i=1°.1l;(y10,y11). 

(y2>yi)i=6*7*8;(y3>yi)i=6>9» 10 ; 

Cy4 ,y± )i =7 , 9 , 1 i ; 

(y  5  »yi  )i  =8 , 10 ,  i  l ; 

(y6,yll),(y7,y10); 

(y8,y9),(y10,y12} 


-2 

o 


I 


1 


-2 

o 


I2 


4.857 


1.400 


10.000 


1.400 


We  find  that  under  Design  I,  any  of  y  ,i=2,...,ll  is  the  least 

-2 

influential  w.r.t.  both  Ij  and  12-  Any  pair  of  observations  with  0  Ij 

equals  11.333  is  the  most  influential  w.r.t.  I;.  On  the  other  hand,  any 

-2 

pair  of  observations  with  a  I2  equals  1.600  is  the  most  influential 
w.r.t.  12*  The  variability  in  values  of  I2  is  so  small  that  it  is  very 
hard  to  assess  the  influence  w.r.t.  I2  under  this  design. 


Design  II 

We  consider  a  complete  2  factorial  experiment  with  treatments 


written  in  the  order 

(1234, 

0,  i. 

2,3 

,4,12 

,13, 

14, 

23,24 

,34 

,123 

,124 

,134 

> 

234).  The  matrix 

Z 

is  as  follow 

‘  1  1 

-1 

-1  -1 

-l 

1 

1 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

0  0 

1 

-1  1 

-l 

0 

-2 

0 

0 

2 

0 

1 

-1 

1 

-1 

Z~( .25) 

0  0 

1 

-1  -1 

l 

0 

0 

-2 

2 

0 

0 

-1 

1 

1 

-1 

0  0 

1 

1  -1 

-l 

-2 

0 

0 

0 

0 

2 

1 

1 

-1 

-1 

2  -2 

1 

1  1 

l 

0 

0 

0 

0 

0 

0 

-1 

-1 

-1 

-1 

This  design  is  robust  against  the  unavailability  of  any  three  observa- 

t  i  ons  [Ghosh  (  1979)  ].  It  can  be  checked  that  I  ^  (y .  )  -  2.2  a2  and  I^y^) 

2 

(.6875)0  for  i=l,...,16.  Thus  the  individual  observations  are  equally 

influential  w.r.t.  both  Ij  and  l2.  for  any  pair  of  observations 

corresponding  to  treatments  with  zero  or  three  levels  in  common,  the 
-2 

value  of  o  Ij  is  8.000.  For  every  other  pair  of  observations,  the 
_2 

value  of  o  Ij  is  A. 667.  We  therefore  see  the  validity  of  the  equation 

2 

(10)  in  Theorem  3  for  this  example  since  2  I^(y^)  =  4.4o  .  The  value  of 
-2 

O  l2  for  any  pair  of  observations  is  a  constant  1.375.  The  remark  on 


12  for  Design  I  also  holds  for  Design  II 
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